Algorithm Application Support Vector Machine with Genetic Algorithm Optimization Technique for Selection Features for the Analysis of Sentiment on Twitter
نویسنده
چکیده
Twitter has become one of the most popular micro-blogging platform, recently. Millions of users can share their thoughts and opinions about various aspects and activites. Therefore, twitter considered as a rich source of information for decision-making and sentiment analysis. In this case, the sentiment is aimed to overcome the problem of automatically classifying user tweets into positive opinion and negative opinion. The classifier Support Vector Machine (SVM) used in this study is a machine learning technique that is popular text classifiers, as Support Vector Machine (SVM) algorithm is one that has a linear calcification of the main principles for determining the linear separator in the search space that can best separate the two classes different. But the Support Vector Machine (SVM) has the disadvantage that the appropriate parameter selection problem. The tendency in recent years is to simultaneously optimize the features and parameters for Support Vector Machine (SVM), so as to improve the accuracy of classification on Support Vector Machine (SVM). Genetic Algorithm has the potential to produce better features and becomes optimal parameters at the same time. This research generate text classification in the form of positive and negative tweets on twitter. Measurement accuracy is based on Support Vector Machine (SVM) before and after using a Genetic Algorithm. Evaluation was performed using 10 fold cross validation while accuracy is measured by the confusion matrix and ROC curves. The results of the study showed an increase in accuracy of Support Vector Machine (SVM) from 63.50% to 93.50%.
منابع مشابه
Feature Selection Using Multi Objective Genetic Algorithm with Support Vector Machine
Different approaches have been proposed for feature selection to obtain suitable features subset among all features. These methods search feature space for feature subsets which satisfies some criteria or optimizes several objective functions. The objective functions are divided into two main groups: filter and wrapper methods. In filter methods, features subsets are selected due to some measu...
متن کاملForecasting Stock Price Movements Based on Opinion Mining and Sentiment Analysis: An Application of Support Vector Machine and Twitter Data
Today, social networks are fast and dynamic communication intermediaries that are a vital business tool. This study aims at examining the views of those involved with Facebook stocks so that we can summarize their views to predict the general behavior of this stock and collectively consider possible Facebook stock price movements, and create a more accurate pattern compared to previous patterns...
متن کاملIntelligent application for Heart disease detection using Hybrid Optimization algorithm
Prediction of heart disease is very important because it is one of the causes of death around the world. Moreover, heart disease prediction in the early stage plays a main role in the treatment and recovery disease and reduces costs of diagnosis disease and side effects it. Machine learning algorithms are able to identify an effective pattern for diagnosis and treatment of the disease and ident...
متن کاملApplication of Genetic Algorithm Based Support Vector Machine Model in Second Virial Coefficient Prediction of Pure Compounds
In this work, a Genetic Algorithm boosted Least Square Support Vector Machine model by a set of linear equations instead of a quadratic program, which is improved version of Support Vector Machine model, was used for estimation of 98 pure compounds second virial coefficient. Compounds were classified to the different groups. Finest parameters were obtained by Genetic Algorithm method ...
متن کاملHeart Rate Variability Classification using Support Vector Machine and Genetic Algorithm
Background: Electrocardiogram (ECG) is defined as an electrical signal, which represents cardiac activity. Heart rate variability (HRV) as the variation of interval between two consecutive heartbeats represents the balance between the sympathetic and parasympathetic branches of the autonomic nervous system.Objective: In this study, we aimed to evaluate the efficiency of discrete wavelet transfo...
متن کامل